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WALLACE'S  WEAK  MEAN  SQUARE  ERROR  CRITERION 
FOR  TESTING  LINEAR  RESTRICTIONS  IN  REGRESSION: 
A  TIGHTER  BOUND 

T.A.  Yancey,  G.G.  Judge  and  M.E.  Bock 


A  sharper  set  of  sufficient  conditions  for  the  criterion 
used  for  the  weak  mean  square  error  test  of  Wallace  is 
established  to  replace  the  sufficient  conditions  of  his 
development . 

1.  Introduction 


In  a  recent  issue  of  this  Journal,  Wallace  [4],  building  on  some  of  his 
earlier  work  [1,3],  with  the  K  parameter  classical  linear  statistical  model 

(1.1)  y  =  X§  +  e, 

under  J  linear  restrictions  or  hypotheses 

(1.2)  RB  -  r  =  0, 

presents  a  test  for  determining  when  the  difference  of  the  risk  functions  of 
the  least  squares  unrestricted  and  restricted  estimators  is  positive.  Wallace's 
sufficient  condition  for  the  risk  of  the  restricted  estimator,  E(B-3)'(3-6)  to  be 
at  least  as  small  as  that  of  the  unrestricted  estimator,  E(b-3)'(b-3)  is  that  the 
magnitude  of  the  specific  errors,  5,  in  the  null  hypotheses,  (1.2),  be  such  that 

(1.3)  ^  ^  J  ^k  tJ^S"-^R'(RS"-^R')"-'RS"-^  =  ^0  •' 


Where  pj^  is  the  smallest  characteristic  root  of  S=X'X.  The  parameter  X  equals 
(RB-r)'(RS"'^R')*'^(RB-r)/2a^  which  equals  6 '  (RS'^^R')  ""^(5/20^  and  is  the  noncen- 


trality  parameter  in  the  F,.  ^  „  „.  distribution  of  the  test  statistic,  u  = 

(.A  ,J  ,  1  -Nj 


(Rb-r)'(RS"-^R')"-^(Rb-r)/Ja^ 
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The  purpose  of  the  present  paper  is  to  develop  a  sharper  set  of  conditions 
for  determining  (i)  where  the  risk  function  of  the  restricted  estimator  is  less 
than  that  of  the  conventional  estimator,  (ii)  when  it  is  impossible  to  determine 
which  risk  function  is  smaller,  and  (iii)  when  the  risk  function  of  the  con- 
ventional estimator  is  less  than  that  of  the  restricted  estimator. 

A  comparison  of  the  results  of  Wallace  and  those  of  this  paper  appears  in 
Section  2,  and  Section  3  summarizes  our  results  and  indicated  directions  for 
further  work. 

2.   Reparameterization  of  the  Model 

Repararaeterization  of  the  model,  (1.1)  and  (.1.2),  is  a  convenient  device 

for  clarifying  the  behavior  of  the  risk  function  of  3, the  restricted  estimator. 

Using  a  non-singular  matrix  W  =  QP  such  that  WB  =  6  and  Z  =  XP  Q'the  original 
model,,  (1.1)  and  (1.2),  becomes 

(2.1)  y  =  Ze  +  e 
and 

(2.2)  [IjO]e  -  r^  =  0 

-1 '      -1 
where  P  is  such  that  P   (X'X)  P   =  1  ,  Q  is  an  orthogonal  matrix  such  that 

QBQ'=  \p«Jo/,  since  B  =  P"-^ 'r' (RS'-^R')  "■^RP'"-^  is  idempotent,  and  r  =  G  "^r 
from  RP'^^Q-QP'^^'r  =  RS'^^R'  =  GjGj'  where  RP~"^Q'  =  [GO].  Of  course,  Z'Z  =  I„ 
and  any  estimator  6  implies  an  estimator  W  9=3  for  3- 

The  least  squares  estimator,  restricted  estimator,  test  statistic,  and 
non-centrality  of  the  reparametrized  model  are  |  =  Z'j^  '\j  N(6,o  Ij,) , 

Cs  -r  ) ' (ft  -r  ) 


\ 


respectively,  where  9     =   [O  I„  ,]  9. 

The  weighted  risk  functions  for  the  least  squares  unrestricted  and  re- 
stricted estimators,  which  are  the  same  as  the  unweighted  risk  function  of 
b  and  3,  are 

(2.3)  p(b,3)  =  E(b-B)'(b-B)  =  £(6-9)  •W"'-^'w"-^(9-e3  =  aHrW"-^'w"-^  =  a^trS"^ 

=  oHn     d.  +  S    d. 
\i=l  ^   i=J+l 

where  d.,*«',  d  and  d^  ■,>'">  '^v   ^'^^  characteristic  roots  of  A  =  [l_  0]  W~  W~  [I,  0] ' 

and  [0  Ij,  ]  W'^  W"-^[0  Ij^_j]'  respectively,  and 

(2.4)  p(6.B)  =  E(B-B)'(B-B)  =  E(6-8) 'W'-^'w"-^(6-6) 

=  E(6-6)'W"^'w"^(9-6)  +  E(r^-9j)'A(r^-ej)  -  £(6^-6^) 'A(ej-ej) . 

In  order  to  determine  conditions  under  which  the  risk  function  for  b  exceeds 
that'  for  3,  we  subtract  (2.4)  from  (2.3)  giving 

(2.5)  E(b-B)'(b-§)  -  E(3-3)'(0-3)  =  -  (ej-r^)'A(9j-r^)  +  E(6j-ej)'A(ej-ej) 
which  is  non-negative  if  and  only  if 

(2.6)  £(6^-6^)  •A(9j-9p  =  o^rA  >  (B^-r^)  'A(9^^-r^)  . 

From  a  theorem  on  extrema  of  quadratic  forms  [2,p.5l], 
Sup      ^§J-?o^ '^^gj-^o^  _  ^ 

(Qj-To^  (Qj-ro^'tej-lo)  '  ^ 

where  d.  is  the  largest  characteristic  root  of  A,  we  have 

(2.7)     aHrA.  (8j-r^)-A(6j-r^)  <   dj^(9^-r^)  •  (9j-r^)  =  dj^2Xa^ 
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Furthermore,  S'^^R' (RS"'^R')'"^RS"'^  =  P"'^Q'QP"'^ 'r' (RS"^R ')  ^RP~'^Q'QP'''- ' 

=  yi'^ilj   0]'[lj  0]W"-^',  S"-^  =  P'-^Q'QP"-^'  =  W'-^W"-^',  trW"-^[lj  0]'[lj  0]W"'^' 

-1      -1*  -1  -1 

=  trA,  and  trS   =  trW   W  .  The  non-zero  characteristic  roots  of  W  [l,  0] 


[Ij  o]w"^' 


0^0 


w-^'w-^ 


J„  and  A  are  the  same  as  are  the  characteristic  roots 


of  S~  and  W"  K   since  the  order  of  matrices  does  not  affect  the  values  of  the 


characteristic  roots  of  their  product.   It  is  easy  to  see  that  d^   is  less  than 

-1'  -1 

x'W   W  X 

the  largest  characteristic  root  of  S'   ,   ^^^  - — 77-; '-—    for  all  x,  since 


x'x 


(x«  0')W"-^'w'-^(x  0) 
Sup^_Z_i! — - — __- — I^ —  =  d,  ,  restricting  x„  ^  to  be  0,  must  be  at  least  as 

small  as  the  supremum  for  all  x.  Hence,  the  largest  root  of  S   is  greater  than 
or  equal  to  that  of  S'^^R' (RS"-'-R')"'^RS"  ,  which  is  d^^. 

Thus  a  sufficient  condition,  and  the  smallest  one  that  will  always  hold, 
for  the  risk  of  b  to  be  at  least  as  large  as  B  is  that 

(2.8)     X  ^  ^  trA  =  ^  trS"-^R'(RS~-^R')"-^RS"-^. 

This  expression  is  the  same  as  (1.3)  with  \i,  which  is  the  reciprocal  of  the 
largest  root  of  S  ,  replaced  by  1/d.  so  that  the  bound  obtained  for  \  must 
be  greater  than  or  equal  to  that  in  t  le  Wallace  result. 

To  explore  the  composition  of  the  size  of  the  two  risk  functions  (2.5), 
we  use  an  orthogonal  transformation  C  to  obtain  the  characteristic  roots  of  A  so 


(2. 5a)    E(b-$)'Cb-B)  -  E(3-&)'C6-B)  =  oHrk   -  (S.-r^) 'C'CAC'CCe.-r J 


ii.  -i 


=  a^trA  -  C'DC  =  a^^  X  d.  -  E  ?  .M. 

-  -     •  1  1  -ill 
1=1    1=1 

where  D  is  a  diagonal  matrix  whose  elements  are  the  characteristic  roots,  d.,of 
A  and  |  is  a  vector  incorporating  the  specification  error  in  the  exact  restric- 
tions. One  can  perform  the  conceptual  experiment  of  varying  the  values  of  the 

elements  of  f  so  that  only  one  of  them  is  non  zero  while  X  =  x   remains  constant, 

20^ 


•t 
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thus  2Xa^  -  ^i'   ^"'^ 


(2.5b).  E(b-3)'Cb-3)  -  E(6-grCS-6)<  a^  Z  d.   -  2Xo^d, 

5=1  ^      . ^ 


Since  we  never  know  the  values  fi   £.,  the  risk  function  of  b  is  greater  than 

equal -to  that  of  3  if  X  <  T.  d./d     and  without  information  on  the  values  of  5  no 

i=l 
tighter  bound  is  possible.  A  similar  conceptual  experiment  associated  the  smallest 

characteristic  root  of  A,  d  ,  shows  that  the  risk  of  B  must  exceed  that  of  b 

J 
whenever  X  <  E  d./2d  .  Nothing  can  be  said  in  a  given  problem  situation 
~  i=l  ^   ^ 

Id.  Zd. 

about  which  risk  function  is  larger  when  - —  <  X  < 


2dt 


2d. 


For  the  risk  function  of  B>  the  inequality 


(2.9) 


E       d.+2Xd 
.    ,  ,    1         s 
j.=J+l 


<  E(6-6)'($-6)  i.  a'' 


Y.       d.+2Xd, 

•Til  L 

i=J+l 


holds  for  each  value  of  X  with  the  equalities  holding  at  X=0  where  the  risk 
function  of  §  is  smallest.  The  terms  in  (2.9)  are  also  equal  if  all  the  roots 
of  S  are  equal  as  is  the  case  for  an  idempotent  matrix.  Furthermore,  since  R 
and  S  are  known,  the  characteristic  roots  of  S~  R'(RS~  R')~  RS"  are  easy  to 
obtain. 

5.  Concluding  Remarks 


By  repararaeterizing  the  model,  a  sharper  result  has  been  given  for  com- 
paring the  risks  of  the  restricted  and  conventional  least  squares  estimators. 
The  relation  of  the  risk  functions  of  the  least  squares  unrestricted  and  restricted 
estimators  have  been  shown  to  involve  a  region  for  X  when  p(b,B)  >  p(B,3),  a 


region  of  A  where  uncertainty  exists  about  the  comparison  of  pCb.B)  and  p(3,3)» 
and  a  region  for  X  where  pCb,6)  <   p(3,3).  Without  knowing  the  values  of  ^  there 
is  no  way  to  reduce  the  region  of  uncertainty  further.  The  use  of  minimum  risk 
as  a  criterion  for  the  choice  of  an  estimator  is  not  new,  nor  is  the  abandonment 
of  unbiasedness,  but  these  changes  in  criteria  have  not  been  exploited  very 
systematically  in  applied  econometric  work.  The  work  of  Wallace  [4]  and  this  ex- 
tension provide  a  systematic  basis  for  determining  the  statistical  consequences 
in  a  risk  context  of  making  use  of  each  of  the  estimators. 

University  of  Illinois  at  Urbana -Champaign 
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